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Abstract —  Predictable  execution  enforcement  for  applications 
with  highly  and  arbitrarily  fluctuating  resource  usage  requires 
runtime  resource  management.  Correct  runtime  predictions 
regarding  resource  usage  of  individual  components  allows 
making  proper  resource  reservations,  enabling  a  better 
resource  management  of  the  component-based  applications. 
This  work  presents  a  state-based  resource  usage  model  for  a 
component,  in  which  states  represent  CPU  utilization 
intervals.  This  resource  model  is  intended  for  a  resource- 
aware  component  framework  where  it  will  be  used  to  determine 
the  quality  of  resource  reservation.  For  this  purpose,  the  model 
offers  two  metrics:  failure  rate,  which  measures  the  fraction 
of  the  reservation  periods  for  which  the  reserved  budget  was 
insufficient,  and  resource  waste,  which  measures  unused 
budget.To  illustrate  the  model,  we  apply  it  to  a  family  of  reser- 
vation prediction  strategies  and  validate  the  outcome  by  means 
of  a  series  of  experiments  in  which  we  measure  the  resource 
utilization  of  two  video  components.  The  latter  requires  a 
method  for  monitoring  resource  states  which  is  also  presented, 
analyzed  and  validated  in  this  paper.  ' 

Index  Terms —  runtime  resource  management,  component- 
based  applications,  resource  model,  resource  usage,  resource 
prediction 

I.  Introduction 

Component-based  systems  engineering  is  a  software 
engineering  technique  that  allows  building  complex  systems 
from  predefined  building  blocks.  One  of  the  main  advantages 
of  this  technique  is  the  possibility  to  distribute  the 
components  among  different  nodes.  However,  running 
component-based  applications  on  distributed  platforms 
implies  also  disadvantages  as,  for  instance,  difficulties  in 
resource  management.  In  this  work,  we  are  interested  in  the 
management  of  applications  realized  using  a  component- 
based  software  technology  and  exhibiting  fluctuating 
resource  demands.  Examples  of  such  applications  are  found 
in  the  area  of  video  processing,  both  in  multimedia  [1]  and  in 
surveillance  applications  [2],  where  the  required  processing 
power  is  highly  dependent  on  the  video  content. In  previous 
work  [3],  David  et  al.  (2010)  sketched  an  architecture  of  a 
resource- aware  component  framework  that  targets  such 
applications  and  identified  a  number  of  resource  management 
strategies  that  can  be  applied  in  response  to  fluctuations  in 
both  resource  demand  and  resource  supply.  These  strategies 
range  from  local  adaptations  providing  a  remedy  for  demand- 
supply  mismatches  on  a  single  node  to  system-wide 
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reconfiguration  that  involves  component  migration,  or  even 
modification  of  the  application  topology  through  replication 
or  replacement  of  components.  In  this  paper,  we  address  a 
true  local  strategy,  namely  dynamic  resource  reservation 
adjustment.  In  the  context  of  multiple  applications,  resource 
reservations  [4]  are  a  means  to  manage  the  usage  of  shareable 
resources.  In  reservation-based  resource  management,  each 
component  of  every  application  in  need  of  a  particular 
shareable  resource  is  assigned  a  budget  for  that  resource. 
Such  a  budget  is  called  a  reservation  and  it  bounds  the 
utilization  of  the  resource  by  the  component. 

The  typical  goal  of  resource  reservation  adjustment  is  to 
make  sure  that  resource  provisions  exceed  resource  demands, 
but  only  with  a  minimal  amount.  Making  resource 
reservations  and  adjustments  presupposes  knowledge  of 
component  resource  demands.  When  component  demands 
are  fairly  stable  and  can  be  estimated  at  design  time, 
appropriate  resource  reservations  can  be  made  at  deployment 
time  that  last  for  the  entire  operation  time  of  the  application. 
Since  these  reservations  must  cater  for  the  highest  occurring 
demand,  they  are  necessarily  conservative.  In  systems  with 
highly  fluctuating  resource  demands,  static  resource 
reservation  is  therefore  not  suitable  because  it  typically  results 
in  a  too  large  waste  of  resources.  From  this  we  conclude  that, 
for  the  class  of  applications  we  are  interested  in,  reservations 
should  be  made  at  runtime  and  adjustment  should  take  place 
on  a  regular  basis.  A  potential  downside  is  that  this  leads  to 
'best  effort'  behavior  instead  of  'guaranteed'  behavior.  Ideally, 
a  component's  resource  reservation  should  be  such  that  it 
resembles  its  resource  utilization  profile  as  closely  as 
possible,  but  always  being  conservative.  This  means  that 
resource  reservation  should  be  based  on  correct  prediction 
of  resource  utilization. 

Contributions.  In  this  paper,  we  propose  a  resource  model 
and  reservation  adjustment  strategies  that  achieve  this  goal, 
in  the  following  way.  First,  we  introduce  a  state-based 
component  resource  model  where  resource  utilization  is 
classified  in  a  number  of  so-called  states.  Second,  we  describe 
our  monitoring  method  and  analyze  its  accuracy.  Third,  we 
perform  a  number  of  experiments  to  validate  the  monitoring 
accuracy  and  to  determine  proper  model  (state  size)  and 
monitoring  (sampling  period)  parameters.  Finally,  we  examine 
a  set  of  prediction  strategies  in  which  resource  reservation  is 
based  on  the  prediction  of  the  next  utilization  state  from  the 
current  one.  The  main  goal  of  this  is  to  illustrate  the 
effectiveness  of  the  model  which  is  expressed  by  two  quality 
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metrics  for  prediction:  Fail,  which  measures  how  often  an 
application  lacks  sufficient  resources  and  Waste,  which 
measures  over-provisioning  of  resources.  We  limit  attention 
in  this  work  to  the  processor  resource. 

The  organization  of  the  paper  is  as  follows.  In  Section  2 
we  discuss  related  work.  In  Section  3  we  present  the 
underlying  resource  model.  In  Section  4  we  describe  a  series 
of  experiments  to  validate  model  assumptions  and  to  determine 
the  effect  of  model  parameters  on  the  quality  of  the 
reservations  for  a  simple  prediction  rule.  In  Section  5  we 
conclude  with  a  summary  of  achieved  results  and  directions 
for  future  research. 

II .  Related  Work 

In  [5],  existing  approaches  for  model-based  software 
performance  prediction  are  classified  into  several  categories: 
queueing  network-based,  process-algebra-based,  Petri-nets- 
based,  simulation-based  and  stochastic  processes-based 
approaches.  In  this  work  we  look  at  resource  utilizations  using 
a  stochastic  approach.  In  [6]  and  [7]  the  CPU  resource  usage 
is  analyzed  and  modeled  with  the  purpose  of  runtime 
adaptation  and  reconfiguration  of  the  system.  The  work 
addresses  mainly  systems  with  architectural  particularities 
regarding  CPU  usage  (i.e.  CPU  throttling)  and  the  focus  is  on 
the  energy-awareness  of  the  systems.  The  goal  of  our  work 
is  on  achieving  QoS -aware  systems  through  runtime  resource 
management. 

In  [8],  [9]  and  [10]  the  resource  consumptions  of  a 
particular  class  of  medical  video  applications  (i.e. 
interventional  X-ray  applications)  are  investigated  and 
modeled  with  the  goal  of  scalable  and  cost-efficient  Quality 
of  Service  (QoS).  The  work  demonstrates  that  when  the 
resource  usage  of  tasks  is  purely  random  or  correlated  for 
only  short  periods  of  time  it  can  be  predicted  with  Markov 
Decision  Models  (MDM).  Another  approach  for  prediction 
based  on  MDM  and  for  a  particular  codec  (i.e.  MPEG-2)  is 
found  in  [11],  where  the  main  goal  is  to  maximize  the  quality 
of  the  output  as  perceived  by  the  user  by  managing  temporal 
load  fluctuations.  Our  work  addresses  video  applications  in 
a  more  generic  way  and  implies  different  runtime  prediction 
strategies. 

In  [12]  the  resource  consumptions  of  long-running 
applications  are  analyzed  and  modeled  based  on  their  history, 
with  the  goal  of  optimizing  the  runtime  resource  provisioning 
by  means  of  observed  resource  usage  patterns.  The  resource 
execution  states  are  achieved  by  using  a  clustering  algorithm. 
In  our  work,  states  are  predefined. In  [13],  [14]  and  [15]  a 
parameterized  component  resource  model  is  presented,  in 
which  resource  usage  is  captured  by  three  values 
representing  worst-case,  best-case  and  average-case  resource 
demand.  These  and  other  parameters  are  provided  by  the 
user  of  the  framework  and,  once  defined,  are  constant.  In 
[10],  a  similar  model  is  introduced  for  the  description  of 
component-based  software  architectures.  The  model  is 
designed  with  a  special  focus  on  the  design  time  prediction 
of  quality  attributes,  e.g.  performance  and  reliability. 
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In  both  [16],  [17]  and  [18],  the  implicit  assumption  is  that 
targeted  applications  are  such  that  resource  management  can 
be  treated  as  a  design-time  problem.  It  is  also  assumed  that 
the  designer  will  have  a  means  to  identify  correct  parameters. 

Our  component  resource  model  targets  applications 
characterized  by  highly  and  transiently  fluctuating  resource 
demands,  e.g.  video  applications.  Therefore,  our  resource 
model  needs  to  accommodate  resource  management  in  which 
resource  reservations  are  updated  at  runtime  from  predictions 
based  on  previous  resource  usage.  In  [19]  a  model  for  a 
specific  video  decoder  component,  viz.  H.264,  is  shown  that 
uses  key  metadata  from  the  video  stream  to  make  resource 
predictions.  However,  resource  prediction  based  on  key 
metadata  is  not  applicable  to  our  case  as  we  want  a  generic 
resource  prediction  that  will  be  correct  for  any  video 
processing  component,  or  any  component  for  that  matter. 
While  resource  usage  of  a  specific  video  decoding 
component  can  be  related  to  key  metadata,  in  the  general 
case  the  resource  usage  of  a  component  depends  largely  on 
the  purpose  of  the  component,  the  way  it  is  implemented,  the 
input  data,  and  the  internal  state  of  the  component. In  [20] 
similar  work  for  measuring  the  CPU  utilization  of  video 
applications  is  shown,  where  resource  monitoring  is  realized 
using  a  white-box  approach,  i.e.,  by  code  instrumentation  of 
the  components.  In  general,  a  white-box  monitoring  approach 
is  not  always  possible. 

Moreover,  periodic  sampling  of  multiple  components  on 
a  single  core  will  result  in  erroneous  information  in  the 
recorded  data.  Even  when  all  components  succeed  to  sample 
their  resource  usage  with  the  same  sampling  period,  the 
measurements  are  by  construction  out  of  phase,  since  there 
can  be  only  one  process  active  on  a  core  at  any  moment  in 
time.  In  this  work,  we  target  periodic  sampling  and  apply  a 
black-box  approach  to  simultaneously  trace  individual 
components  of  a  component-based  application  by  a  separate 
monitor. 

III.  Resource  Model 

In  this  section,  we  present  a  resource  model  that 
addresses  the  situation  in  which  a  single  component  C  uses 
a  single  shared  resource  R  for  an  indefinite  period  starting  at 
time  ts  . 

For  tj  <  tb  <  te,let  u(tb,  te)  denote  the  utilization 
of  resource  R  by  component  C  in  the  interval  [t^i  ^e  X  i-C-, 
the  fraction  of  R  used  by  C .  In  this  paper,  we  focus  on  CPU 
utilization,  so  the  utilization  is  the  fraction  of  the  interval 
\tb,  ("4,)during  which  the  CPU  executes  C .  In  general,  the 
utilization  depends  on  the  input  provided  to  C.  Since  we  aim 
at  a  runtime  model,  the  input  to  C  is  implicitly  understood  to 
be  the  current  one,  and  therefore  will  not  occur  as  a  separate 
parameter  in  our  model.Next,  let  r (("{,,  te)  be  the  fraction  of 
resource  R  reserved  for  component  C  in  the  interval  [tb,  £  ) 
Ideally,  reservations  should  satisfy  H(j  £  \  <  r(t  t  ) 
Reservation-based  resource  management  ror  a  component 
with  a  fluctuating  resource  demand  involves  the  repeated 
determination  of  reservations  r.  each  of  which  is  valid  for  an 
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FailCUNtXN)=j-]t{i\Q 


accompanying  reservation  period.  For  the  sake  of  simplicity, 
we  henceforth  assume  that  these  periods  are  equal  and  have 
duration Pril,  .So,  for   Q  <  i 

Let  tj  =  Jj.  +  i  .  Pr:ir  be  the  moments  at  which  these 
reservations  are  made,  and  let  ui  =  u(tj,£i+1)  and 
Tj  =  r(ti,  t;+1)  be  the  utilization  and  reservation  for  the 
/-th  interval  |"£.  £,  \  ,  respectively.  Collecting  utilizations 
and  reservations  over  N  intervals  results  in  a  utilization 
trajectory  1lN  =  {u^  |  0  <  i  <  /V]  and  a  reservation 
trajectory  !RN  =  {r;  |  0  <  I  <  N]  respectively. 
We  measure  the  suitability  of  a  reservation  trajectory(f(!>r  a 
utilization  trajectory  by  two  metrics.  The  first  metric  is  the 
failure  rate,  defined  as  the  fraction  of  the  intervals  for  which 
the  utilization  exceeds  the  reservation,  i.e., 

"  <i<NAri<  Utl(l) 

where  operator  "#"  denotes  the  cardinality  of  a  set.  The 
second  metric  is  the  (resource)  waste,  defined  as  the  average 
reservation  surplus,  i.e., 

W0Ste(UN,3lN)  =  -So£i<«  maxCO,^  -  Uj). 

Some  remarks  concerning  these  metrics  are  in  place.  First, 
a  positive  failure  rate  does  not  imply  that  an  application 
crashes.  In  practice,  it  is  more  likely  that  it  suffers  occasional 
and  transient  reductions  in  QoS.  Second,  a  positive  waste 
does  not  imply  that  the  CPU  is  idle  during  some  reservation 
periods.  An  application  may,  and  indeed  should,  release  the 
CPU  when  it  has  consumed  its  necessary  fraction  (£#,  Thus, 
it  will  give  a  scheduler  the  opportunity  to  allocate  the 
remaining  fraction  J^  —  li^  of  its  reservation  to  other 
applications,  e.g.,  those  that  execute  without  any  reservation 
such  as  cycle  scavengers.  Third,  any  reservation  strategy 
must  involve  a  trade-off  between  these  two  metricssince  they 
respond  conversely  to  changes  in  reservation. A  transition 
of  a  single  reservation  j-  fromj-j  <  Ut  to  rt  >  II  j  leads  to 
a  decrease  in  failure  rate  by  \f  N.  whereas  the  waste  incre- 
ases by  (r-  —  u  ■  )/JV  due  to  the  same  change.  Hence,  we 
are  also  interested  in  the  minimum  waste  W  Mn{'ll^  )  obtained 
by  any  reservation  trajectory  whose  failure  rate  does  not 
exceed  the  value    —  ,  defined  as 

WM^)   = 

m[waste(liN>^N)\Fail(JUN,7lN)<^] 

In  the  sequel,  we  investigate  reservation  strategies  in 
which  reservation  Jn  is  based  on  the  preceding  utilization 
trajectory Un  =  {u,  |  0  <  I  <  n\. which  lend  themselves 
to  run-time  reservation.  For  that  purpose,  we  first  limit  the 
amount  of  information  contained  in  a  utilization  trajectory  by 
dividing  the  utilization  range  [0, 1]  into  JI5  equal-sized  sub- 
ranges, to  which  we  shall  refer  as  (utilization)  states.  To  be 
precise,  for  any  utilization  value  u  e  [0, 1],  its  state  Qs(u}  is 
the  rank  of  the  utilization  interval  to  which  its  belongs,  i.e., 
we  define  cr5(0)=  land  ff$(u)  =  \nsll],  for  0  <  u  <  1 
So  Cs(h)ls  a  natural  number  satisfying  1  <  0"5(u)  <  ns. 
and  for  a^Owe  also  have 

(ers(n)  -  1)S  <  u  <  o-s(u)S, 
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where  S  is  the  state  size  defined  by  nsS  =  1 .  In  the  sequel, 
we  analyze  the  effect  of  state-based  reservation,  i.e.,  strategies 
whose  reservations  are  an  integral  multiple  of  the  state  size 
S.  Obviously,  the  state  size  influences  the  resulting  waste.  In 
particular,  it  is  an  immediate  consequence  of  (4)  that  the 
minimum  waste  at  zero  failure  rate  is  given  by 

mw0(un)  =^E0^w(>sK)s-iO  <  S.       (5) 

Recall  that  we  target  applications  with  highly  and  rapidly 
fluctuating  resource  utilization.  Therefore,  validity  of  (5) 
implicitly  assumes  that  the  reservation  period  is  such  that  a 
reservation  rn  =  O5  {lin)S  is  sufficient  to  cater  for  all 
fluctuations  within  the  n-th  period.  It  is  also  fairly  obvious 
that  the  failure  rate  of  state-based  reservation  is  related  to 
the  accuracy  of  strategies  to  predict  the  next  utilization  state. 
If  the  predictions  Sn  of  such  a  strategy  satisfy 
Frob(as(un)  >  sn)  <  e. 

for  some  predefined  small  value  a ,  then  the  expected  failure 
rate  is  at  mostf  when  we  choose  reservations  accordingly, 
i.e.,  rn  =  S^S. 

Because  we  want  to  predict  the  n-th  utilization  state  Un  from 
the  trajectory  ^n-,  we  are  interested  in  the  nature  of  state 
transitions  within  these  trajectories.  In  particular,  reasonable 
predictions  can  be  obtained,  when,  for  any  application  input, 
state  transitions  between  reservation  periods  are  mostly 
between  neiahborins  states. 

So,  for  1  <  a,b  <  ns.  lei  <p(atb)  ,let  denote  the  fract- 
ion of  all  state  transitions  in  l^n;hat  lead  from  a  to  b,  i.e., 

(p{atb)  = ,  w 

where  0"j(u_1)  =  1  by  definition.  Moreover,  for  —  ns  < 
k  <  rl^  let  f  denote  the  fraction  of  state  transitions  in 
which  the  resource  utilization  changes  by  more  than    feSi.e., 

fk  =  £Q+fc<i^(a,&).  (7) 

To  illustrate  the  usefulness  of  utilization  states,  we  analyze  a 
class  of  very  simple  reservation  strategies  for  CPU-usage. 
For  (J  <  k  <  n?  >  we  consider  a  strategy  that  produces 
the  reservation  trajectory  Ji     —  {t-  I  0  <|  I  <  H\    wim 

r;  =  min  ((a^u^)  +  k)S,  l).  (8) 

In  other  words,  the  reservation  for  a  period  is  based  on 
the  actual  (monitored)  utilization  in  the  previous  period  and 
the  "prediction"  that  the  utilization  state  will  increase  by  at 
most  k.  Of  course,  the  state  thus  predicted  should  not  exceed 
which  means  the  reservation  thus  made  should  not  exceed  1 . 
In  the  sequel  we  will  refer  to  k  as  the  prediction  parameter  of 
the  reservation  strategy.  Next,  let  T£vbe  a  utilization  traje- 
ctory and  let  JJ,v  be  the  corresponding  reservation  trajectory 
generated  by  strategy  k.  It  can  be  shown  that  the  associated 
failure  rate  is  given  by 

FaiI(HN^N)  =  fk.  (9) 

In  addition,  the  waste  can  be  bounded  in  terms  of  the  failure 
rate  and  the  state  transition  fractions.  First  note  that,  by  (4), 
we  have  ifj  >  (as(u^  —  1)5  and,  by  (8),  we  have 
J";   <  (os(Ui-i)  +  k)S  Application  of  these  bounds  to  (2) 

vcACEEE 


Full  Paper 


ACEEE  Int.  J.  on  Information  Technology,  Vol.  3,  No.  1,  March  2013 


gives  the  inequality 

Wa$te(tLnrMw)  <  W{k,$),  (10) 

where  the  bound  on  the  right  hand  side  is  given  by 
W(k,S)  = 

^o*l<n  max(Gl(as(ui_t)  +  k  -  (%C«t3  - 1)5) 

Next,  introducing  names  for  states  and  eliminating  terms 
with  value  zero,  the  right  hand  side  of  this  inequality  can  be 
rewritten  as 

Su+fcafi-Zo^i-cw  (_a-\-k-b  +  1)5. 

Since  the  terms  of  the  inner  summation  do  not  depend  on 
the  index  of  summation,  and  expressing  state  transitions  by 
summation  instead  of  counting,  i.e.,  using  the  equality 


<p(a,b)=  -T,Q,i<NAl 


instead  of  formula  (6),  we  obtain 

W(k,S)  =  £a+Jt^ <p(.a, b)(a+k-b  +  1)5.  (11) 

Obviously,  the  waste  depends  both  on  the  state  size  S  and 
on  the  prediction  parameter  k.  Moreover,  because  the  state 
transition  fractions  <p  (_(l,  b)  depend  on  the  reservation  period 
Prsv  ,so  does  the  waste. 

By  a  tedious  but  straightforward  computation  it  can  be 
shown  that 

W(k,S)  =  (1  -  fk)S  +  W(k  -  1,5).  (12) 

This  can  be  appreciated  as  follows. For  strategy  k,(\  —  f  ) 
is  the  fraction  of  reservation  periods  without  failure,  and 
therefore  exhibiting  some  amount  of  waste.  For  those  periods 
that  would  also  be  without  failure  at  strategy  k-1  ,the 
additional  waste  is  precisely  S,  for  the  others  at  most  S 

In  Section  5,  we  present  experiments  that  measure  the 
influence  of  k  on  the  failure  rate  and  reservation  waste. 
Furthermore,  we  investigate  the  robustness  of  the  model  in 
the  context  of  a  simple  video  application  (i.e.  a  video  player). 

IV.  Monitoring  Method 

In  the  previous  section  we  have  introduced  a  reservation 
strategy  based  on  observed  CPU  usage  in  the  past.  An 
implementation  of  such  a  strategy  requires  the  ability  to 
monitor  the  CPU  usage  of  an  application,  for  which  we  must 
rely  on  operating  system  primitives  with  limited  accuracy. 
Moreover,  utilization,  being  defined  as  a  relative  quantity, 
can  often  not  be  measured  directly  by  an  OS  primitive. 
Instead,  the  OS  usually  provides  a  pri-mitive  to  measure  the 
amount  of  work  done  by  the  hardware  (i.e.  CPU)  on  behalf  of 
a  software  component.  In  this  section,  we  work  out  the  details 
of  such  a  (time-based)  monitoring  method  for  an  operating 
system  from  the  Microsoft  Windows  family.  In  particular,  we 
consider  a  single  application  (containing  one  component) 
running  on  a  dedicated  CPU-core  whose  execution  starts  at 
time  *  0 .  So,  for  t  >  tn  ,  let  w  (f)  be  the  work  (measured  in 
time)  spent  by  that  core  on  A  since  t-Then,  the  utilization  by 
of  the  CPU  during  the  interval  [t^,  t)  is  given  by 

u(t)  =  w(t)/(t-ta)  W 
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Next,  assume  that  u(t)  is  sampled  with  a  fixed  period  "i77lp 
and  let   U.;  be  the  measured  utilization  in  period  \tit t ;+-,*). 
where  t,  =  t0  +  '^mp  Then 


",-^i)-"ft)=    lT  (14) 

.,,.,..  »„„[..„„.  |,„,vU    iTTip  may,  but  need  not,  be  the  same  as 
the  reservation  period  Prsv  of  the  resource  model. 

In  a  Windows  environment,  an  accurate  measurement  of 
the  work  is  obtained  using  the  Windows  Performance 
Counters  r?  1  ] .  Given  such  a  counter  PC  for  A,  we  obtain  an 
estimate  u;  for  «*i  ,  given  by 

%  =  (PC[i  +  1]  -PC[E-])/(ti+1  -it),  i15> 

where  PC[i]   is  the  value  of  PC  inspected  at  time   *i .  In 
practice,  the  accuracy  of  ui  is  determined  by  two  factors: 
i.  the  update  granularity,  i.e.  the  minimal  amount  by  which  the 
counter  is  incremented,  and 

ii.  the  inspection  time  jitter,  i.e.,  the  difference  between  the 
desired  time  of  inspection  tj  and  actual  time  of  inspection  f. 
For  the  sake  of  simplicity  we  will  assume  that  the  effects  of 
jitter  are  negligible,  i.e.,  we  will  assume  that  t;  ■=  f(  and 
consequently    ti  +  1  -t^  =  P^ 

Hence,  it  remains  10  determine  the  effect  of  the  update 
granularity  on  the  values  recorded  by  the  counter. 
So,  assume  an  update  grain  size  .  Then,  at  any  time  ,  the 
relation  between  the  counter  value  and  the  actual  work  will 
be  given  by 

PC(t)  =   [w(tyTu\Tu,  (16) 

i.e.,  the  value  of  the  counter  always  contains  the  largest 

multiple  of  the  update  grain  size  smaller  than  or  equal  to  the 

actual  work  performed  by  the  CPU.  So,  for  all  t  >  t-Q   ,  we 

have 

0  <  w(t)  -  PC(t)  <  Tu.  (17) 

An  immediate  consequence  of  this  counter  granularity  is 
that  it  imposes  a  lower  bound  on  the  state  size  S .  Let  lie  be 
the  maximum  number  of  counter  updates  during  a  sample 
period,  then  this  is  also  the  maximum  number  of  states  yis  that 
can  be  distinguished.  Hence,  we  obtain 


S  = 


>  —  = 


T"u     |'sm.p'~ul 


^£Dtp  Kjm.p/'iil 


(18) 


"i"  "C  [Psmp/rJ 
To  all  intents  and  purposes  we  replace  the  right  hand 
side  of  (18)  by  the  more  conservative  JL/Jemw  Next,  we 
determine  the  maximum  error  in  measured  utilization,  both 
per  sample  and  in  the  average  per  trajectory.  Using  formulae 
(14)  and  (15)  and  recalling  that  we  assume  absence  of  jitter, 
we  find  for  the  error  per  sampling  period 

_  \{pc(ti+j-pcito)-{witi+1)-wai})\    (19) 


Uj 


Li- 


1  JtKfJ 


(20) 


Combining  (17)  and  (19)  yields  the  upper  bound 

'smp 

Next,  consider  a  complete  trajectory.  In  the  absence  of  jitter, 
and  using  P£.'(£0)=  0  ,  it  follows  from  (15)  that 


i*0£i<:WUi  ~    ^OSi<N 


PC(.tt+t]-PCit£i    _    PCjtfj) 
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Using  u(tN)  =  w(tlV)//V/:,i-mp  and  inequality  (17)  again, 
we  obtain  tor  tne  error  in  tne  average  utilization  per  trajectory 

pcn^-wityA        r„(21) 


1W>« 


WP<- 


Observe,  that  the  latter  bound  is  inversely  proportional  both 
to  N,  the  number  of  samples  in  the  trajectory  and  NPsmp,  in 
case  of  a  video  application,  the  total  playing  time  .  Of  course, 
similar  bounds  hold  for  averages  taken  over  arbitrary 
consecutive  subtrajectories. 

V.  Experiments,  Results,  Discussion 

In  this  section  we  describe  the  experiments  that  evaluate 
the  state-based  resource  reservation  strategies  in  the 
following  way.  First,  we  describe  how  to  validate  the 
monitoring  method  used  for  collecting  the  samples.  Second, 
we  identify  the  proper  sampling  period  for  monitoring.  Third, 
we  describe  how  to  determine  an  appropriate  reservation 
period  for  conducting  these  experiments. 

The  software  component  used  in  the  experiments  is  a 
simple  video  player,  based  on  the  OpenCV  library  [21].  This 
OpenCV-based  video  player  executes  two  types  of  activities: 
decoding  a  video  stream  and  displaying  it.  These  activities 
are  performed  by  the  component  on  a  frame  by  frame  basis, 
yielding  an  alternating  sequence  of  decoding  and  displaying 
steps.  Furthermore,  during  the  experiments,  we  enable  a  video 
filtering  feature  of  the  video  component.  This  increases  the 
amount  of  work  and  thereby  yields  a  considerable  number  of 
occupied  utilization  states. 

The  hardware  platform  used  in  the  experiments  comprises 
a  desktop  PC  equipped  with  an  Intel®  Core™  i7  860  CPU 
model  [22],  running  at  2.80  GHz.  The  operating  system  is 
Windows  7.  This  operating  system  allows  monitoring  of  CPU 
usage  through  performance  counters  that  are  updated  with  a 
fixed  granularity  of  f  =  15.6  ms  ,  which  means  that  the 
counter  is  increased  whenever  the  CPU  has  spent  another 
amount  j>  of  time  on  the  associated  process.  To  avoid 
inspection  time  jitter,  we  execute  the  resource  monitor  that 
records  |*  on  a  separate  core,  using  busy  waiting  and 
assigning  real-time  priority  to  prevent  other  applications  from 
suspending  the  monitor  beyond  the  next  sample  time,  as  could 
happen  when  using  a  sleep(t)  primitive  to  let  the  monitor  idle 
until  sampling  is  due. 

A.  Monitor  Error  Estimation 

Consider  a  set  of  yU.N  1  <  j  <  JW  of  Mruns  on 
the  same  input,  and  let  AjjfZL.)  denote  the  difference 
between  the  maximum  and  the  minimum  of  measured 
utilization  averages  (Xosr^fl  ut    }/"  m  this  set- 

Furthermore,  let  the  error  coefficient  E  be  defined  by. 

AMiUfl)  =  E-(TjNPsmf>) 

Then,  according  to  (2 1 ),  for  any  two  runs  of  the  application 
on  the  same  input  the  measured  average  CPU-utilization 
should  differ  by  at  most    27'^/ NPim„ 

Hence,  in  theory,  should  stay  below  2,  but  in  practice  there 
will  be  some  deviation  due  to  model  abstractions  and  the  fact 
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that  the  work  per  run  may  vary  slightly.  Table  I  presents  the 
outcome  of  an  experiment  of  runs  with  fixed  playing  time,  but 
varying  sampling  periods.  It  can  be  observed  that  in  all  cases 
the  error  coefficient  is  as  expected.  Table  II  shows  the 
behavior  of  the  error  coefficient  when  the  playing  time 
increases.  Again  the  error  coefficient  is  almost  constant, 
showing  that  the  accuracy  of  the  measured  average  utilization 
indeed  scales  inversely  proportional  to  the  playing  time.  The 
results  of  the  experiments  presented  in  this  section  imply 
that  the  monitoring  method  is  valid  and  has  a  bounded  error 
behavior. 

Table  I.  Error  Estimation  With  Fixed  Playing  Time  But  Varying 
Sampling  Periods 


N 

p 

rm 

VC*U 

B 

**u 

1 530 

156  ms 

10 

0.4  e4 

1.28-4 

32 

1250 

312  ms 

20 

0.4-4 

ore  =4 

IS 

500 

780  ms 

50 

0.4  a4 

0.38-4 

10 

250 

1560  ms 

100 

0.4  -4 

0.38-4 

10 

125 

3120  ms 

200 

0.4  =4 

0.^6 -4 

IS 

50 

7800  ms 

500 

0.4  e4 

0.38-4 

1.0 

Table  II.  Error  Estimation  With  Varying  Playing  Time  And  Varying 
Sampling  Periods 


N 

pmT 

aw*™ 

Tv 

AvC^Q 

E 

SPm 

25 

1560  ms 

103 

39  s 

3.4  --3 

2.63-4 

0.7 

50 

7B0ms 

50 

35  s 

3.4s-3 

2.63  -4 

0.7 

100 

1560  ms 

135 

156  s 

3.1  --3 

1.7S  =-4 

1.3 

250 

1560  ms 

133 

390s 

3.4-4 

3.84  -4 

2.1 

i  000 

L  :53  ms 

133 

1560  s 

3.1  -4 

0.2flfr4 

25 

:::o 

3120  ms 

200 

3900  s 

3.4  =-5 

0.08-4 

2.0 

2500 

t  :63  ms 

100 

3933  s 

3.4--: 

0.09  -4 

23 

B.  Determination  of  Sampling  Period 

The  goal  in  this  section  is  to  establish  the  appropriate 
period  P  which  will  be  used  for  monitoring  the  components. 
In  order  to  do  this,  we  perform  an  experiment  that  consists  of 
sampling  the  resource  usage  of  a  video  component  with 
different  values  of  p  for  a  sufficiently  long  playing  time 
(65  minutes)  and  movie  length  (97500  frames  at  25fps  frame 
rate).  The  movie  used  in  the  experiment  was  xvid  encoded 
and  placed  in  an  avi  container.  The  results  of  the  experiment 
are  displayed  using  histograms.  Figure  1,  displays  the 
utilization  state  frequency  profile  of  a  measured  trajectory.  On 
the  X-axis  the  CPU  utilization  is  represented,  using  a  state 
size  of  5=0.05,  resulting  in  20  states.  On  the  Y-axis  the 
occurrence  frequency  (in  %)  of  a  state  in  the  trajectory  is 
given.  As  can  be  observed  by  the  gaps  between  the  occupied 
states,  a  sampling  period  of  156ms  is  too  small.  This  is 
consistent  with  formula  (14)  which  tells  us  that  at  this  sampling 
rate  only  10  states  can  be  discerned.  As  the  sampling  period 
is  increased,  the  profile  converges  to  a  profile  consisting  of  a 
single  occupied  state  that  represents  the  average  utilization 
over  the  entire  trajectory.  As  can  be  seen,  convergence  is 
almost  achieved  at  a  sampling  period  of  1560ms,  which  at  the 
given  frame  rate  of  25fps  corresponds  to  processing  39  frames. 
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Since  our  goal  is  to  provide  accurate  runtime  management  of 
resources  by  capturing  rapid  state  fluctuations,  however,  we 
should  aim  at  a  sampling  period  where  the  resource  usage 
profile  is  not  dominated  by  a  single  state.  Therefore,  the 
experiment  suggests  that  a  sampling  period  P  around  500ms 
is  appropriate. 

Furthermore,  it  does  not  make  sense  to  have  a  reser-vation 
period  P  smaller  than  the  period  at  which  we  monitor  the 
utilization.  Combination  of  this  observation  with  formula  (14) 
yields  the  inequality    n^   <  Psmp  <  f^. 


CPU  mvyc  iljk- 


&   !|    .-^  ■: 


Fig.  1.  The  utilization  state  frequency  profile  of  the  CPU  usage 

trajectory  of  OpenCV-based  video  player,  monitored  with  P       I 

{156ms,  312ms,  780ms,  1560ms,  3120ms,  7800ms) 

C.  Prediction  Strategy  Evaluation 

In  this  section  the  resource  reservation  strategies  defined 
in  (7)  are  evaluated  by  means  of  the  two  metrics  introduced 
in  Section  3:  failure  rate  Fail  (1)  and  resource  waste  Waste 
(2).  Since  these  metrics  depend  both  on  the  prediction 
parameter  k  and  the  state  size  S ,  this  leads  to  an  exploration 
of  the  (k,S  )  space.  From  (7)  it  follows  that  for  any  fixed  state 
size  S  the  failure  rate  is  decreasing  with  k,  i.e.,    /fc+J   <  [k. 

Similarly,  it  follows  from  (2)  and  (8)  that  the  resource  waste 
is  increasing  with  k.  Keeping  k  fixed  and  increasing  the  state 
size  S,  it  can  be  expected  that  a  similar  relationship  exists, 
because  making  reservations  in  larger  chunks  tends  to 
overshoot  the  required  resources  more  and  more  often, 
thereby  increasing  waste  and  reducing  failure  rate. 
Nevertheless,  there  is  no  mathematical  certainty. 

To  begin  with,  the  resource  usage  state  transition 
frequency  profile  of  the  monitored  component,  when  the 
sampling  period  was  set  to  Psmp  =  780ms  and  state  size  S  = 
0.02,  is  visualized  in  Figure  2. 

On  the  horizontal  axes  of  the  graph  the  previous  (a)  and 
the  current  (b)  resource  usage  state  of  the  monitored 
component  are  represented.  On  the  vertical  axis,  the  state 
transition  frequency  function  ^  is  plotted.  Observe,  that  all 
states  are  in  the  range  [25.. 36]  (of  a  total  of  50  states)  and 
that  most  of  the  state  transitions  are  clustered  around  the 
main  diagonal  a=b.  This  indicates  a  stable  resource  usage 
behavior,  where  transitions  are  mostly  between  neighboring 
states.  A  more  quantitative  illustration  of  this  fact  is  given  in 
Table  III,  where  the  values/k  are  used  to  compute  the  fractions 
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Fig.  2.  State  transition  frequency  profile  of  a  video  player;  P      — 
780m.v;  N=5000;  S  =  0.02 

the  state  transitions  that  show  an  increase  of  more  than  k 
states  (top  row),  a  decrease  of  more  than  k  states  (middle 
row),  or  at  most  k  states  in  either  direction  (bottom  row). 
Observe  that  the  entries  in  the  latter  row  rapidly  approach  1 . 

Next,  we  computed  the  quality  metrics  Fail  and  Waste,  in 
order  to  estimate  the  quality  of  the  resource  prediction 
method  introduced  in  this  work.  For  that  purpose,  we  applied 
a  series  of  strategies  (k,S),  with  reservation  parameter  k 
varying  over  the  set  and  usage  state  size  S  varying  over  the 
set  [0,02,  0,04.,  0,05.,  0,1}  to  the  same  utilization  trajectory 
that  gave  rise  to  Fig.  2.  The  outcome  of  that  computation  is 
shown  in  Table  IV.  In  addition,  we  determined  the  minimum 
waste  MW obtainable  for  each  strategy,  i.e.,  the  waste  given 
by  (3),  where  we  took  fl  =  Nfk  For  the  sake  of  clarity,  all 
quantities  in  Table  IV  are  given  as  a  percentage  rather  than 
as  a  fraction.  For  /;,  and  Wt  we  indeed  observe  the 
monotonicity  patterns  described  at  the  beginning  of  this 
section.  Moreover,  note  that  for  fixed  S  the  actual  waste 
behaves  almost  identical  to  its  upper  bound  as  given  by  (12). 

Finally,  note  that  grey  shaded  entries  represent  strategies 
that  are  dominated  by  other  ones.  In  this  case,  they  are  all 
dominated  by  the  entry  (k,s)  =  (4,4)  which  happens  to  have 
the  minimal  failure  percentage  of  0.02  %  (the  reservation  for 
the  first  period  is  always  too  small).  In  general,  strategies  can 
be  dominated  by  ones  that  use  a  smaller  state  size  as  illustrated 
by  the  next  example.  In  this  example,  we  consider  resource 
prediction  of  a  motion  detection  component  applied  with  the 
same  reservation  period,  and  applied  to  the  same  video,  as 
before.  For  this  component  the  state  transition  frequency 
profile  is  given  in  Fig.  3 

Table  III.  State  Transition  Fractions 
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Table  IV.  Fail  And  Waste  For  Various  Strategies  -  Video  Player; 
780ms;  N=5000 


Table  V  Fail  And  Waste  For  Various  Strategies  -  Motion  Detection 
Component;  P      =  780ms;  N=5000 


k 
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3.44 
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Fig.   3.   State  transition  frequency  profile  of  a  motion  detection 
component;  Psmp  =  780ms;  N=5000;  S  =  0.02 

Note  that  utilization  of  this  component  covers  far  more 
states  than  the  one  in  Fig.  2.  Nevertheless,  state  transitions 
are  once  again  clustered  along  the  main  diagonal.  In  addition, 
we  see  three  peaks  indicating  clusters  of  densely  populated 
states. The  Waste  and  Fail  statistics  of  the  various  strategies 
for  the  motion  detection  component  are  given  in  Table  V. 
Besides  the  monotonicity  properties  already  described  before, 
we  now  also  see  more  interesting  dominance  patterns. 
Strategy  (k,s)  =  (0,5)  is  dominated,  i.e.  lower  failure  rate  and 
lower  waste,  by  (1 ,2)  and  (1 ,5)  is  dominated  by  (0,3).  All  other 
points  are  Pareto  points,  showing  that,  in  general,  there  will 
be  a  trade-off  between  resource  waste  and  failure  rate, 
depending  on  the  QoS  desired  for  the  application. 

Conclusions  and  Future  Research 

This  work  introduces  a  state-based  resource  model 
suitable  for  making,  at  runtime,  resource  reservations  based 
on  resource  usage  prediction.  The  model  includes  two  metrics, 
namely  Fail  and  Waste,  which  evaluate  the  quality  of  a 
reservation  strategy. The  effectiveness  of  our  model  is 
illustrated  by  a  set  of  simple  prediction  strategies  as  a  function 
of  two  parameters,  utilization  state  size  and  the  state  change 
in  subsequent  reservation  periods.  The  first  parameter  serves 
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9.14 
8.65 
0.00 

1.54 
17.18 
1.04 

0.70 

2199 

193 

0.02 
44.06 
4.15 

as  the  unit  of  reservation  and  the  latter  captures  the  maximum 
change  in  utilization  accommodated  by  the  reservations. 
Insights  for  making  a  proper  choice  for  these  parameters,  in 
order  to  obtain  an  acceptable  balance  between  the  obtained 
results  for  the  two  resource  prediction  quality  metrics,  are 
revealed  by  exploring  the  parameter  space. 
Our  future  research  involves  more  advanced  prediction 
strategies  based  on  prediction  algorithms  with  higher 
complexity.  In  particular,  the  presence  of  clustered  states, 
with  state  transitions  occurring  mainly  between  clusters,  gives 
rise  to  the  more  sophisticated  concept  of  runtime  resource 
usage  modes  [23].  Since  sojourn  times  in  modes  are  expected 
to  be  longer  than  in  individual  states,  prediction  of  modes 
should  be  more  accurate  than  prediction  of  states,  and 
therefore,  yield  improved  resource  management  with  respect 
to  Fail  and  Waste. 
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